Search CORE

22 research outputs found

A novel Boolean kernels family for categorical data

Author: Aiolli Fabio
Lauriola Ivano
Polato Mirko
Publication venue: 'MDPI AG'
Publication date: 01/01/2018
Field of study

Kernel based classifiers, such as SVM, are considered state-of-the-art algorithms and are widely used on many classification tasks. However, this kind of methods are hardly interpretable and for this reason they are often considered as black-box models. In this paper, we propose a new family of Boolean kernels for categorical data where features correspond to propositional formulas applied to the input variables. The idea is to create human-readable features to ease the extraction of interpretation rules directly from the embedding space. Experiments on artificial and benchmark datasets show the effectiveness of the proposed family of kernels with respect to established ones, such as RBF, in terms of classification accuracy

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Padova

Radius-margin ratio optimization for dot-product boolean kernel learning

Author: Aiolli Fabio
Lauriola Ivano
Polato Mirko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Institutional Research Information System University of Turin

The minimum effort maximum output principle applied to Multiple Kernel Learning

Author: Fabio Aiolli
Ivano Lauriola
Polato Mirko
Publication venue: Michel Verlesian
Publication date: 01/01/2018
Field of study

Institutional Research Information System University of Turin

Radius-margin ratio optimization for dot-product boolean kernel learning

Author: Aiolli Fabio
Lauriola Ivano
Polato Mirko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Archivio istituzionale della ricerca - Università di Padova

The minimum effort maximum output principle applied to Multiple Kernel Learning

Author: Aiolli Fabio
Lauriola Ivano
Polato Mirko
Publication venue
Publication date: 01/01/2018
Field of study

Archivio istituzionale della ricerca - Università di Padova

An introduction to Deep Learning in Natural Language Processing: Models, techniques, and tools

Author: Aiolli Fabio
Lauriola Ivano
Lavelli Alberto
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Natural Language Processing (NLP) is a branch of artificial intelligence that involves the design and implementation of systems and algorithms able to interact through human language. Thanks to the recent advances of deep learning, NLP applications have received an unprecedented boost in performance. In this paper, we present a survey of the application of deep learning techniques in NLP, with a focus on the various tasks where deep learning is demonstrating stronger impact. Additionally, we explore, describe, and revise the main resources in NLP research, including software, hardware, and popular corpora. Finally, we emphasize the main limits of deep learning in NLP and current research directions

Archivio della ricerca - Fondazione Bruno Kessler

Archivio istituzionale della ricerca - Università di Padova

Pre-trained transformers: an empirical comparison

Author: Alberto Lavelli
Ivano Lauriola
Silvia Casola
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Pre-trained transformers have rapidly become very popular in the Natural Language Processing (NLP) community, surpassing the previous state of the art in a wide variety of tasks. While their effectiveness is indisputable, these methods are expensive to fine-tune on the target domain due to the high number of hyper-parameters; this aspect significantly affects the model selection phase and the reliability of the experimental assessment. This paper serves a double purpose: we first describe five popular transformer models and survey their typical use in previous literature, focusing on reproducibility; then, we perform comparisons in a controlled environment over a wide range of NLP tasks. Our analysis reveals that only a minority of recent NLP papers that use pre-trained transformers reported multiple runs (20%), standard deviation or statistical significance (10%), and other crucial information, seriously hurting replicability and reproducibility. Through a vast empirical comparison on real-world datasets and benchmarks, we also show how the hyper-parameters and the initial seed impact results, and highlight the low models’ robustness

Archivio della ricerca - Fondazione Bruno Kessler

Archivio istituzionale della ricerca - Università di Padova

Classification of categorical data in the feature space of monotone DNFs

Author: Aiolli Fabio
Lauriola Ivano
Polato Mirko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Archivio istituzionale della ricerca - Università di Padova